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^ (54) Title: TREATMENT OF CANCER 

^ (57) Abstract: The product of a gene located at human chromosome 21q1 1-21 has ubiquitin specific protease activity and binds 
to ubiquitin, polyubiquitin, ubiquiUn-like protein SUMO-3, and proteins which are implicated in the repair/excision of DNA. It is 
Q believed to have a role in regulating cell growth, cell growth arrest and/or apoptosis and is impUcated in carcinoma growth. The 
^ gene has been sequenced and cloned into a microorganism and the product's efifect on cell growth is to be investigated. The gene 
>^ has GenBank accession number AF 1342 13 and has the HUGO-approved name USP25. 
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Treatment of Cancer 

The present invention relates to the use of the product of the USP25 
gene, which appears to have ubiquitin specific protease activity and may 
have protease activity on ubiquitin-like proteins, located at human 
5 chromosome 21 q1 1 in the treatment, prophylaxis or diagnosis of cancer, 
especially solid tumours, more particularly lung cancer. 

Ubiquitin mediated proteolysis by the 26S proteasome is responsible 
for the physiological regulation of the levels of many proteins which are key 
cell cycle and cell growth regulators (Hochstrasser 1995), as well as known 

10 tumour suppressors (Lane 1998). 

Ubiquitin is a 76 amino acid polypeptide present in all eukaryotic 
cells, and highly conserved in evolution. Ubiquitin is conjugated to a target 
protein through an isopeptide bond between the e-amino group of Lys in a 
target protein (ubiquitination), a process mediated by three groups of 

IS enzymes: ubiquitin activating enzymes (El), ubiquitin conjugating enzymes 
(E2) and ubiquitin ligases (E3). Ubiquitinated proteins exist in a 
monoubiquitinated form, or a multiublquitin chain: the former is not a 
degradation signal, while the latter, Lys-48-linked ubiquitln-ubiquitin(n) 
conjugate, works as a strong degradation signal when joined to a Lys in a 

20 target protein. Protein conjugated to polyubiquitin is then very rapidly and 
efficiently degraded by non-lysosomal, ATP-dependent degradation by the 
26-S proteasome. The de-ubiquitinating enzymes (DUB's) broadly fall into 
two classes; ubiquitin specific proteases (USPs) and ubiquitin-C-terminal 
hydrolases (UCHs) are each capable of de-conjugating the ubiquitin- 

25 ubiquitin and ubiquitin-protein links, thereby converting polyubiquitin into 
mono-ubiquitin. and de-coupling (deubiquitinating) ubiquitin from the target 
protein, with the result of preventing the degradation of the target protein. 

The class of UCH enzymes tends to include relatively small proteins 
(about 35-40kD) which have low specif ity for ubiquitinated proteins. BAP1 is 

30 a UCH but is unusual in that it is larger, having a weight of nearly lOOkD. 
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Several such enzymes are known, the sequences of which show some 
sequence homologies especially in two domains, the Cys and His domains. 

Very broadly, two main functions have been observed among the 
various members of the USP (UBP) superfamily (Wilkinson 1997, Wilkinson 
5 and Hochstrasser 1998). The first is the generation of free ubiquitin from 
precursor fusion proteins or from peptide-linked polyubiquitin after 
proteolysis of the targeted protein, and the second is de-ubiquitination. 
When a protein is targetted for ubiquitin mediated degradation, it is linked to 
ubiquitin via an isopeptide bond between the C-terminus of ubiquitin and a 

10 lysine t-amino group(s) of the acceptor protein. Once the conjugate is 

formed, it can have only two fates: non-lysosomal proteolysis mediated by 
the 26S proteasome resulting in total protein degradation, or de-conjugation 
from ubiquitin (de-ubiquitination), resulting in the rescue of the target protein 
from degradation (Wilkinson 1997, Wilkinson and Hochstrasser 1998). 

15 The regulation of p53 levels by USP25, and consequential prescribed 

increased cellular predisposition to apoptosis (programmed cell death) upon 
overexpression of USP25, could be a mechanism by which neuronal loss 
occurs in Alzheimer's Disease associated with Down Syndrome. Some 
mechanism could be invoked in explaining the sporadic AD if overexpression 

20 of USP25 occurs through a somatic change in aging neurons. 

The potential USP catalysed de-ubiquitination and rescue from 
degradation of tumour suppressors such as p53, could perhaps explain the 
link to deletions of USPs in solid tumours. De-ubiquitination could play a 
major role in the Mdm2 mediated control of p53 levels and its activation 

25 mechanism, since the ubiquitin-mediated proteasome degradation of p53 is 
an important effector arm of this recently revealed control pathway (Haupt et 
al., 1997, Kubbutat et aL, 1997, Lane 1998). Halving the dose of a USP 
could give a pre-cancerous cell a selective advantage in proliferation, by 
diminishing its rate of "re-cycling" of p53, making it more difficult to achieve a 

30 threshold concentration of p53. necessary for its activation. The same 
hypothetical, gene-dose regulated, tumour suppression model could also 
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explain why trisomy 21 (Down's syndrome DS) seems to confer protection 
from development of solid tumours. Among DS children, neuroblastomas and 
nephroblastomas, and among DS adults, gynaecologic, digestive and breast 
cancers have very rarely been reported, and are significantly under- 
5 represented compared to the age-matched euploid population (Oster et a!., 
1975, Satge etaL, 1998). 

Baker et al, 1999 also propose a role for USP's in tumour formation 
and cell growth. 

In recent years a number of other protein modifying polypeptide tags 

10 have been identified. Many of these are related to ubiquitin and have high 
levels of identity and similarity (determined using the BLAST algorithm, for 
instance) to ubiquitin itself. There is a recognised super family of such 
proteins which have been termed ubiquitin-like proteins (UbL) (Gong et al. 
1997, Schwarz et al. 1998). The yeast Smt3 and human SUMO-1 (PIC1, 

15 Sentrin, hSmt3C). SUMO-2 (hSmtSA) and SUMO-3 (hSMTSB) belong to the 
same family of UbL proteins with approximately 50% identity between 
themselves, and some 15-30% identity and 40-60% similarity in amino acid 
sequence to ubiquitin (Lapenta et al. 1997. Mannen et al, 1996, Kamitani et 
al. 1998, Saitoh and Hinchey, 2000). Yeast and human UBC9 are capable of 

20 conjugating equally yeast or human UbL-s, but not ubiquitin (Schwarz et al. 
1998), The SUMO-1, -2 and -3 have the C-terminal glycine, necessary for 
ubiquitination of the target protein's lysine residue, but unlike ubiquitin, do 
not have the Lys48 residue necessary for the formation of polyubiquitin 
chains through isopeptide bonds, which are the signal for the proteasome 

25 degradation (Saitoh and Hinchey, 2000). Nevertheless, yeast Smt3 protein 
can rescue the mutant Mif2 phenotype, a deficient centromere binding 
protein resulting in chromosome missegregation (Meluh and Koshland 
1995). SUMO-1, as well as SUMO-3 (and probably also SUMO-2) are all 
capable of being attached by UBC9 to RanGapl , a Ran GTP-ase activating 

30 protein (Kamitani et al.. 1998), This ATP-dependent attachment is essential 
for the binding between modified RanGapl and RanBP2 binding protein, in 
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order to form functional nuclear pore complex, which controls export ancf 
import of molecules through the nuclear envelope (Mahajan et al. 1997. 
Matunis et al. 1998, Lee et al. 1998). In addition, UbL small proteins have 
been shown to modify the death domains of Fas (Okura et al. 1996), Tumour 
5 necrosis factor receptorl (Okura et al. 1996), PML (a tumour suppressor 
implicated in the pathogenesis of acute promyelocytic leukaemia) (Kamitani 
et al. 1998b) and Rad51/52 DNA repair proteins (Shen et al. 1996a). Their 
conjugating enzyme, UBC9, has been shown to interact by Y2H technique 
with RAD51/52 DNA repair proteins, and the master tumour suppressor p53 

10 (Shen et al. 1996b). Another UbL is NEDD8 (Kamitami et a/ 1997). 

UbL's are conjugated and cleared from their targets by enzymes. 
Several UbL hydrolase enzymes have been identified which convert 
precursor UbL to active UbL. Some such enzymes interact with ubiquitin 
itself as well as with other UbL's. Proteases involved in cleavage of 

15 conjugates of UbL with target protein have been identified for instance 

SENP1 and SUSP-1. which were recently cloned (Kim et al. 2000, Gong et 
al. 2000a), and found to specifically cleave SUMO-1,-2 and -3, but not 
ubiquitin and NEDD8. The first human enzyme with classical USP structure 
(Cys. His domains) for which dual specificity to both ubiquitin and ubiquitin 

20 like protein was demonstrated was very recently published USP21 on 

chromosome 1q21 (Gong et al. 2000b). However, opposite from SENP1 and 
SUSP-1, this enzyme cleaves ubiquitin and Nedd8, but not SUMO-1,-2 or -3 
(Gong et al. 2000b). 

The proximal third of the chromosome 21 long arm is an exceptionally 

25 gene-poor region of the human genome as estimated by a number of criteria 
(Shimizu et al., 1995, Yaspo et al.. 1995, Gardiner 1996) the estimates of 
gene-density range from one gene in a megabase to one gene in six 
megabases of genomic DNA. Until recently only three full length genes had 
been mapped in this region: STCH (a member of the hsp70 family) (Brodsky 

30 et al., 1995), RIP140 (protein functionally interacting with a variety of nuclear 
receptors such as estrogen receptor), (Cavailles et al., 1995) and ANA (a 
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member of the Tob/BTG1 family of tumour suppressors), (Kohno et al., 
1998). This region is also an example of extremely highly methylated regions 
in the human genome. 

Groet et al. (1998) describe a high-resolution bacterial contig map of 
5 3.4 Mb of genomic DNA in human chromosome 21q11-q21, encompassing 
the region of elevated disomic homozygosity in Down's syndrome • 
associated abnormal myelopoiesis jnd leukemia, and which has shown a 
strong association with Alzheimer's disease (AD). It was suggested that the 
high resolution bacterial clone overlap map should be the basis for deriving 

10 a more complete transcriptional map of that region of the chromosome. It 
was hoped that this would lead to an explanation of the chromosome 21 q1 1 
linkage in familial early onset AD (FEOAD) families. In particular it was 
suggested that a modifier gene in that region could act together with the 
presenilin-1 gene to generate or modify the AD phenotype. 

15 Further work by the present inventors has revealed a new gene in the 

proximal third of chromosome 21 and that the product of this gene has 
ubiquittn specific protease properties. It is postulated that the gene product 
and USPs generally may have a role in AD. The work has been published in 
Groet et al (2000) after the first priority date of the present application. 

20 Valero, et al. (1999) published after the first priority date of the 

present application, have, in parallel identified this gene and pointed out the 
gene product's sequence homologies to known USP's in the conserred 
peptide domains previously identified e.g. by d'Andrea et al (1998). They 
postulate a role in Alzheimer's disease. This protein has the HUGO 

25 approved name USP25. 

According to the present invention there is provided Wie new use of 
the product of the USP25 gene located at human chromosome 21q 1 1-21 , a 
non-human homologue thereof or a functional fragment thereof in the 
manufacture of a medicament for use in the diagnosis, prophylaxis or 

30 treatment of cancer. 
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The cancer is usually a solid cancer, most often non-small cell lung 
carcinoma, or skin cancer. 

A functional fragment herein means a protein having ubiquitin specific 
protease activity or UbL specific protease activity and comprising a portion 
5 with sequence homology with the product of the said USP25 gene. 

It is believed that the ubiquitin specific protease activity of the protein 
having sequence ID 1 is responsible for its implication in the pathogenesis of 
cancer. USP activity may be determined using the technique described in 
the Examples elow. in which using bacteria cotransformed with the USP 
10 gene and with a reporter gene encoding a fusion protein which is a ubiquitin- 
conjugated detectable protein. The protein may be an enzyme detectable by 
direct enzyme reaction, by enzyme-linked immune assay techniques, by 
autoradio-graphically or by direct staining after gel separation under 
conditions suitable to separate ubiquitin and cleaved protein from fusion 
15 protein. 

From experiments conducted to determine with which proteins the 
product having sequence ID 1 interacts, we have found that there is 
interaction with ubiquitin. polyubiquitin and various ubiquitin precursors, as 
well as HHR23A (Matsutani et a/ 1994. GenBank Accession No D21235). 

20 There is also interaction with other ubiquitin-like proteins and with proteins 
which are known to interact with ubiquitin-like proteins, such as Sumo-3 
(Mannen et al 1996. Kamitani ef a/ 1996, Saitoh et al 2000, GenBank 
Accession No. NM 006937) and ubiquitin-like-specific conjugating enzyme 9 
(Schwarz et al 1998, Lee et a/ 1998, GenBank Accession No. U 66867). The 

25 isolated protein may therefore be characterised further by having a positive 
interaction in a yeast-two-hybrid procedure with one or more, preferably all 
three, proteins having the sequences of GenBank Accession Nos. D 21235. 
NM 006937 and U 66867. Ubiquitin-like specific protease activity may be 
determined using techniques analogous to those used to determine ubiquitin 

30 specific protease activity, by using a substrate which is a fusion protein of 
the ubiquitin-like protein of interest and a detectable protein, and using the 
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usual separation and immune based or autoradiographic identification 
techniques. 

The gene may be transcribed and translated within a cell line selected 
as positive for the native gene or an active (in terms of ubiquitin or UbL 
5 specific protease activity) fragment thereof. The gene has preferably been 
introduced into a microorganism or a cell line in a form in which it can be 
transcribed and translated and the microorganism or the celNine, as the 
case may be, has been cultured under conditions whereby the gene is 
replicated during cell division, and is transcribed and translated into ubiquitin 

10 specific protease and the USP is recovered. Preferably the gene in the 
microorganism is recombinant DNA derived from the mRNA from cells 
having the active USP gene. Suitably the gene includes sequence ID No. 7, 
more preferably sequence ID No. 6. 

According to a further aspect of the invention there is provided a new 

15 use of a protein product having Cys, QQD and His domains specified in 
sequence IDs numbers 2, 3 and 4, respectively, in the manufacture of a 
medicament for use in the diagnosis, prophylaxis or treatment of cancer. 
The protein has ubiquitin specific protease or ubiquitin-like specific protease 
activity, and has the three specified domains in the USP or UbLSP active 

20 conformation. Preferably the protein has, outside the specified domains, 
some level of sequence homology with sequence ID 1, for instance at least 
20%, preferably at least 50%, identity with that sequence, and a level of 
similarity of at least 50%, preferably at least 70% or more with that sequence 
(in each case determined using, for instance the BLAST algorithm). 

25 Preferably the protein has sequence I.D.1 or sequence LD.5. Homologues, 
such as the corresponding mouse product, described in Valero, et al 1999 
may be used or sequences which have the above levels of identity and 
similarity with such a sequence. 

According to a preferred embodiment the protein has sequence LD.1. 

30 According to another preferred embodiment the protein has sequence 

I.D.5. 
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According to a further aspect of the invention there is provided an in 
vitro method in which mammalian cells are cultured in the presence of the 
product of the USP25 gene located at human chromosome 21 q1 1 or of a 
homologue thereof, or a functional fragment of said product. 

According to a further aspect of the invention there is provided an in 
vitro method in which mammalian cells are cultured in the presence of a 
protein product having Cys. QQD and His domains specified in sequence 
IDs numbers 2. 3 and 4, respectively. Preferably the protein has, outside the 
specified domains, some level of sequence homology with sequence ID 1, 
for instance at least 20%, preferably at least 50%. identity with that 
sequence, and a level of similarity of at least 50%, preferably at least 70% or 
more with that sequence (in each case determined using, for instance the 
BLAST algorithm). Preferably the protein has sequence LD.1 or sequence 
I.D.5. Homologues, such as the corresponding mouse product, described in 
Valero, et al 1999 may be used or sequences which have the above levels of 
identity and similarity with such a sequence. 

In these in vitro methods the effect of the protein on cell growth, cell 
growth arrest and/or apoptosis is assessed. 

The microorganism containing the specified gene has preferably been 
transformed with a vector comprising at least a portion of sequence I.D,6 
comprising residues 199 to 3367, optionally including an additional 96 b.p. 
exon inserted after base 2356, and optionally including 5' and/or 3' 
untranslated regions (UTR's). The vector preferably comprises 
transcriptional and translational control sequences. The microorganism may 
be a yeast but is most conveniently a bacterium. 

A mammalian cell transfected with a vector preferably comprises 
exons making up a sequence comprising residues 199 to 3367 of sequence 
I.D.6 and optionally also regulatory genomic sequences which are present in 
wild-type chromosome which lead to enhanced expression activity. 
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The sequence from residues 199 to 3367 of sequence ID no 6 is ' 
specified as sequence ID7 and consists of start codon (ATG), an open 
, reading frame of 3165 nucleotides, and a stop codon (TAA), 

According to a further aspect of the invention there is provided a new 
5 use of DNA including the gene located at human chromosome 21 q 1 1 -21 or 
a fragment thereof encoding a functional USP product in the manufacture of 
a medicament for use in the prophylaxis or treatment of cancer. The DNA is 
preferably incorporated in a gene therapy vehicle, and is preferably part of a 
vector, for instance a plasmid vector, or viral vector, capable of transfecting 
10 cells in vivo. The medicament usually includes a pharmaceutically 
acceptable carrier. 

The invention is illustrated further in the accompanying drawings in 

which: 

Figure 1 represents a map of human chromosome 21 
15 Figure 2 represents a map showing the location of exon trapped 

products from the experiments reported below 

Figure 3 represents sequence homologies of USPs 
and 

Figure 4 represents the results of the experiment illustrating USP 
20 properties reported below. 

The following specific description describes the work which has been 
carried out. 

Experimental 

We identify a portion of human chromosome 21 homozygously 
25 deleted in non-small cell non carcinoma (NSCLC) for further study. The 

region contained the DNA marker with the highest NSCLC-associated loss of 
homozygosity (LOH), reported by Kohno et aL We found a shared region of 
overlap (SRO) for the hemizygous loss in other NSCLC. The current work is 
to identify genes in the SRO v/hich have a potential role in tumour 
30 suppression. 
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A total of 42 fresh NSCLC cases have been analyzed from the 
Croatian Tumour Bank (CTB), an initiative with set rules and criteria for 
accumulation of fresh clinical tumor specimens for molecular studies 
(Spaventi et al., 1 994). Of these, half (including tumors #47 and #61 ) were 
5 samples that were recently studied for LOH of the NM23-H1 gene (Bosnar et 
ai, 1997), and the other half were fresh tumors (data obtainable from CTB, 
which also lists the tumor stage in the TNM system, grade, size and survival 
data), in each case, the tumors and normal lung tissue specimens (as 
evaluated by the surgeon) were frozen in liquid nitrogen in the operating 

10 room and further stored at -70**C Genomic DNA was isolated using standard 
procedures (Sambrook et ai, 1989). For each sample, 4|jm serial frozen 
sections were cut, mounted on glass slides, and stained with hematoxylin- 
eosin (H&E). A pathologist confirmed the histologic type of the tumor and 
evaluated the percentage of normal cells within the tumor. Only samples 

15 with less than 20% non-tumor cells were used in this study. 
Markers and LOH Analysis 

Microsatellite analysis was performed using polymerase chain 
reaction (PCR) with appropriate primer pairs (sequences and PGR 
conditions as in Genome Data Base. Johns Hopkins University, Baltimore, 

20 MD), where the forward primer only from each pair was 5' fluorescently 

labeled with Applied Biosystems (ABl; Foster City. CA) Big Dyes^ (6-FAM 
or HEX). Amplification products were analysed using an ABl 310 Genetic 
Analyzer. Size standards (GeneScan 350) were mixed with every sample for 
accurate sizing; the separation of the mixture of denatured fragments was 

25 achieved by electrophoresis through a 47 cm capillary (module GS STR 
P0P4 C) for approximately 30 min. Raw data were analyzed using 
GeneScan and Genotyper software. LOH ratios were calculated exactly as 
described in the GeneScan Applications manual provided by ABl. For each 
individual allele's fluorescence level, an average of 3 independent 

30 electrophoresis-analysis cycles on the ABl 310 was used for calculation. 
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Fluorescence In Situ Hybridization (FISH) 

Unstained 4[jm-thick paraffin block sections were fixed to glass slides, 
and a standard pretreatment protocol was followed for formalin-fixed, 
paraffin-embedded slides. P1 -derived artificial chromosome (PAC)DNA was 

5 labelled with digoxygenin-1 1 -dUTP (Boehringer Mannhein, Germany). 
Approximately 0.5 fjg of each labelled PAC DNA sample was mixed with 5 
pg of Cot1 DNA (Gibco BRL, Gaithersburg. MD), precipitated, denatured, 
allowed to preanneal. and then applied to a denatured slide and hybridized 
overnight. Slides were washed and signal detected using anti-digoxygenin- 

10 rhodamine, followed by DAPI counterstain. Images were captured using a 
Zeiss Axioskop microscope equipped with a charge-coupled device (CCD) 
Photometries, Tucson, AZ) connected to an Apple Powennac 8100 
computer. Images were captured on 3 levels of focus, and each level was 
examined for signals using SmartCapture software (Vysis, Inc., Chicago, IL). 

15 Only nuclei with signals were counted in each level, and the number of 
signals in each cell was determined. B: FISH using a pool of PACs 90B5, 
126N20 and BAC 391 12 as a probe on the paraffin embedded sections of 
the tumour #61. Two signal nuclei are predominant. C: FISH using a pool 
of PACs 73M5 and 135E14 as a probe on the paraffin embedded sections of 

20 the tumour #61. Single signal nuclei are predominant. 
Northern Blot Analysis 

The cDNAs were labelled by random priming and hybridized to human 
multiple tissue Northern blots (Clontech, Palo Alto, CA) containing 2 pg 
polyA + RNA per lane using the protocol recommended by the manufacturer. 
25 The exposure was for 14 hr to Molecular Dynamics (Sunnyvale, CA) 

Phosphorimager screens. The I.M.A.G.E. Consortium (Lennon etal., 1996) 
cDNA clone ID 824710 and the Unigene clone A002B43 have been used as 
labelled probes in separate experiments. 
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Cleavage Analysis of 

Ubiquitin-Met-p-Galactosidase Fusion Protein 

This analysis was performed essentially as descnbed (Everett et aA, 
1997). Model fusion protein ubiquitin-Met-p-galactosidase in a pACYC184 
5 (Cm' replicon) was represented by the plasmid pACYC-Ub-Met-p-gal, a kind 
gift of R. Everett. Plasmid pRB105 containing a Saccharomyces cerevisiae 
ubiquitin-specific protease UBP2 in an IPTG-inducible pBR322 (Amp' 
replicon) was a kind gift of R. Baker, and was used as a positive control. 
The new gene USP25 was cloned from nucleotide position 203 to nucleotide 

10 position 3367 (numbering as in GenBank AF 1 3421 3) into SacUSa/l cloning 
sites of the IPTG-inducible Escherichia coli expression vector pQE30 
(Qiagen, Chatsworth, CA). The £ co// XL-1 blue cells were transformed 
using a standard rubidium chloride-heat shock method with the combination 
of pACYC-Ub-Met-p-gal and either pQE30 vector, pQE30-t;SP25, or 

15 pRB105, and each of the 3 cotransormants was selected on medium 

containing chloramphenicol (42 pg/ml) and carbenicillin (75 jjg/ml). Western 
blots were prepared by electrotransfer to a nitrocellulose membrane 
(Schleicher & Schull, Keene, NH). The |3-galactosidase-containing bands 
were detected by an anti-p-galactosidase polyclonal rabit antiserum (a kind 

20 gift of R. Everett) using an enhanced chemiluminescence (ECL) assay kit 
(rpn 2132; Amersham, Arlington Heights, IL) under conditions recommended 
by the manufacturer. 

tdentification and cloning of USP25 

25 Twelve sequenced exon-trapped products, when analysed using 

BLAST-N against public sequence databases, revealed clusters of 
overlapping cDNA clones. Sequences of our exon-trapped products matched 
exactly the sequences of the cDNAs forming contigs with a large open 
reading frame (ORF). In three cases (see Fig.2): from EST 824710 to 

30 AA209364, from AA307805 to AA081 200 and from N92952r to Z4501 0. our 
trapped exon sequences served to bridge the gaps in the gene sequence 
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using PGR and suitable restriction, ligation and chain extension techniques. 
The combined sequence (sequence ID no.5 and GenBank accession 
number AF1 3421 3) revealed a 199 bp 5'UTR, start codon, an ORF of 3165 
nucleotides encoding a protein of 1055 amino acids, a stop codon, a 3'UTR 
5 of 435 nucleotides and a polyadenylation signal. The total length (without the 
polyA) assembled is 3803 nucleotides. On multiple human tissue Northern 
blots (Fig.3) a band of 4.1 kb is visible in all 16 tissues tested (including the 
normal human lung tissue) with a varying intensity. It is most prominent in 
skeletal muscle and testis, and the latter tissue also reveals a prominent 

10 shorter hybridising transcript of 1 A kb. All tissues also show a larger weaker 
band of 4.9 kb, which could be due to an alternative polyadenylation site. 

In the course of this analysis, the whole genomic sequence of the two 
PACs (73M5 and 135E14) became publicly available by the German Human 
Genome Sequencing Consortium (EMBL accession numbers AJ010597 and 

1 5 AJ01 0598). Comparison of the genomic sequence with the overlapping 

cDNA clones and exon sequences revealed that 12 out of the 24 exons had 
been exon-trapped (hatched rectangles in Fig.2). It also became apparent 
that the region immediately preceeding the first exon of the gene, comprises 
the knovm chromosome 21 CpG island at D21S382 (also known as LL56 Not 

20 I linking clone on the Not I physical map of 21q, Ichikawa et al., 1993). 

When the deduced polypeptide sequence was compared to Swissprot 
and other public databases using BLAST-P, a clear pattern of significant 
homologies {e-^Q^ to 10'^®) to proteins across the evolutionary spectrum of 
eukaryotes was found (Fig.4): all of these proteins belonged to the 

25 superfamily of ubiquitin specific proteases (USP-s) or ubiquitin carboxy- 
terminal hydrolases (UCH-s) (Baker et aL, 1992. Swanson et al, 1996, 
Everett et aL, 1997, Wilkinson 1997. Hansen-Hagge et aL, 1998, Jensen et 
al.. 1998. Fujiwara et aL. 1998, Wilkinson and Hochstrasser 1998, The C. 
elegans Sequencing Consortium 1998). The polypeptide sequences were 

30 most highly conserved around the three domains (the Cys box, the QQD box 
and the His box, Fig.4) known to be essential for the main function of these 
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enzymes: the cleavage of ubiquitin at its carboxy terminus from extension 
proteins (ubiquitin precursors) and ubiquitinated proteins and protein 
fragments targetted for the degradation by the 26S proteasome pathway 
(Wilkinson and Hochstrasser 1998). The Cysteine residue at position 178 
5 and the Histidine residues at positions 599 and 607 (marked with an asterix 
in Fig.4), which were shown to be an absolute requirement for the function of 
USP-s and UCH-s (Amerik et al., 1997, Hansen-Hagge et ah, 1998, 
Wilkinson and Hochstrasser 1998) were found in the correct positions in the 
sequence of the new gene. Since this is a first member of the USP family 
10 known to map onto human chromosome 21 . we named this protein USP25, 
for Ubiquitin Specific Protease on Chromosome 21. 

The novel protein (USP25) cleaves ubiquitin from carboxy- 

terminal fusion proteins 

The ability of USP25 to cleave a model ubiquitin fusion protein 

15 substrate was investigated by co-expression in E Coli. The complete coding 
sequence of USP25 was cloned into a T5-driven, IPTG inducible expression 
vector (pQE30). The new gene USP 21 was cloned from nucleotide position 
203 to nucleotide position 3367 (numbering as in sequence ID no. 2 into Sac 
/Sal cloning sites of the IPTG-inducible E.co// expression vector. As a 

20 positive control, the plasmid pRB105 containing a UBP2 gene encoding a 
S.Cerevisiae ubiquitin specific protease in an IPTG inducible and Amp** 
vector was used. The XL-1 blue strain of E Co// was co-transformed with 
the plasmid containing a ubiquitin-Met-p-galactosidase model fusion protein 
in an IPTG-inducible and chloramphenicol resistant vector, in addition to 

25 either pQE30 vector, pQE30-USP25 or the positive control (pRB105). (each 
of the 3 co-transformants was selected on medium containing 
chloramphenicol (42 pg/ml) and carbenicillin (75 pg/ml). Co-transformants 
were grown to exponential phase, IPTG induced, and the crude protein 
extracts from these cultures were analysed by Western blot using an anti p- 

30 galactosidase antibody. (The western blots were prepared by electro 
transfer to a nitro cellulose membrane (Schleicher and Schuel.)). The p- 
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galactocidase containing bands were detected by an anti-p galactosidase 
polyclonal rabbit anti serum using enhanced-chemiluminescence assay kit 
(ECL, Amersham rpn2132) under conditions recommended by the 
manufacturer. 

5 As can be seen in Fig.4, the uncleaved Ub-Met-p-gal substrate (band 

labelled with an asterix in Fig.4, lane 4) converts to an 8 kDa shorter band 
(triangle in Fig, 4) in the cells co-transformed with either USP25(lanes 5,6) or 
the yeast UBP2 expressing plasmid (lanes 9,10). Constitutive expression of 
USP25 (lane 5) is quite sufficient to cleave to completeness the low levels of 

10 model substrate. The more prominent and highly induced band migrating 
slightly further in the gel than the de-ubiquitinated cleavage product is the 
truncated form of b-galactosidase expressed by the XL-1 blue bacteria 
(compare to lanes 1,2 in Fig.4). This result demonstrates that the novel gene 
product named USP25 can efficiently function as a de-ubiquitinating 

1 5 enzyme. 

From the homologies in the functional domains and from its ability to 
hydrolyse the bond between the C4erminal double glycine of ubiquitin and 
the linking methionine residue (Fig.5). it can be concluded that USP25 is a 
member of ubiquitin specific proteases. 

20 Determination of Proteins with which USP25 interacts 

Functional analysis of USP25 was performed with the aim of detecting 
the cellular proteins which interact with the USP25 protein through protein- 
protein interaction, using Yeast-Two-Hybrid (Y2H) approach. 
Saccharomyces Cerevisiae yeast has well characterised ubiquitin activating, 

25 conjugating and ligating ezymatic machinery, capable of ubiquitinating 
human proteins (Scheffner et al. 1998). A cDNA library from human brain 
cloned in "prey" vector, was co-transfected to yeast cells with USP25 cloned 
in "bait" vector. Since ubiquitin cleaving activity of USP25 was proven (Groet 
et al. 2000). this technique has a theoretical chance of detecting the natural 

30 cellular substrates for ubiquitin cleavage and de-ubiquitination by USP25. 
Since the action of ubiquitin cleavage is very rapid (Wilkinson and 
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Hochstrasser 1998). the cleavage and dissociation from its natural 
substrates for fully active USP25 could preclude the ability to detect the 
interaction through Y2H. In addition, the artificial cross-de-ubiquitination of 
yeasfs own proteins by an overexpressed USP25 could theoretically be 
5 harmful for the yeast cell and/or for the molecular interactions required for 
the Y2H. For these reasons we performed Y2H using USP25-C178A, a site 
directed mutant we recently engineered (the mutation being of the key Cys . 
residue in the Cys region) which abolishes the capacity for cleavage of 
ubiquitin by USP25. but should not interfere with the binding of USP25 to its 

10 natural ubiquitinated substrates, since this residue is conserved between all 
UCH-s and USP-s so far identified. 

Y2H experiment using USP25-C178A cloned in the yeast two hybrid 
"bait" vector pAS2 (Clontech) co-transformed into yeast cells together with a 
human adult brain cDNA library in the "prey" vector pACT2 (Clontech). 

15 Interacting events were visualized by the activation of transcription of all 
three reporter genes: Ade2, Mell and Hfs3. Interacting "prey" sequences 
were verified by PCR-sequencing on the ABI310 automated sequencer, 
using universal vector primers, and analysed by BLAST search on non- 
redundant genome and transcriptome sequence databases. The accession 

20 numbers of the sequences found to be interacting, from the GenBank 
database are given in the table. 



NSCKXID: <WO 0079267A2. 1. > 



wo 00/79267 PCT/GBOO/02446 

17 



Table 1. Summary of frequency and identities of specific interacting 
proteins from iiuman brain with USP25-C178A, detected using Yeast- 
Two-Hybrid teclinique 



Summary of Results by 
decreasing frequency of 
detection of baits 


Numoer ot speciiic 
independent clones 
"fished" by Y2H 


A«*/*aceinn nitmhor 
IIUIIIUCI 


HHR23A 


8 clones 


^^^^ ^ 

D21235 


SUMO-3 


8 clones 


NM 006937 


human UBC9 


5 clones 


U66867 


polyUbiquitin 


4 clones 


AB009010 


Ubiquitin 


3 clones 


X04803 


Ran BP2 protein 


1 clone 


NM 006267 


Various ubiquitin*tike 
precursors (1 or 2 clones 
each): 


4 clones 




Other proteins (1 or 2 

clones each) 


10 clones 





Conclusions - Interaction with HHR23A 

20 DNA repair plays a key role in prevention of carcinogenesis and 

mutagenesis. This is potentially of special significance to solid tumours 
which have exposure to UV and chemical carcinogens as the major risk 
factor, such as cancers of the skin and lung. HHR23A is a homologue of 
yeast RAD23 protein (Masutani et al. 1994), involved in DNA excision-repair 

25 after UV damage and implicated in spindle pole body duplication and cell 
cycle progression in yeast (Watkins et al. 1993, Biggins et al. 1996). Human 
homologues HHR23A and B (Masutani et al. 1994) both belong to a group of 
proteins which, when mutated, lead to Xeroderma Pigmentosum, a rare 
autosomal recessive disorder associated with a high Incidence of sunlight 

30 (UV) induced skin cancers. 

More importantly, hRAD23A has also been isolated as a primary 
interacting protein by the same Y2H technique using E6AP as a "bait" 
(Kumar et al. 1999). E6AP (Human Papilloma Virus E6 associated protein) 
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functions as one of the two so far detected ubiquitin ligases (attaching 
ubiquitin and labelling for degradation) for the master tumour suppressor p53 
(Scheffner et al. 1993). The p53 and HHR23A are the only two so far proven 
targets for this ubiquitin iigase (Kumar et al. 1999).. Since USP25 shows 
5 high rate of target preference for HHR23A (see Table 1 ), and both HHR23A 
and p53 are ubiquitinated by E6AP, it could mean that they are both de- 
ubiquitinated by USP25. The fact that Y2H with USP25 did not pick up p53 is 
understandable, because p53 is expressed in small traces (very low level) in 
normal tissues, and gets only accumulated and activated following DNA 
10 damage or other stimuli for programmed cell death (apoptosis) (Haupt et al. 
1997, Kubbutat et al. 1997. Lane 1998). Further experiments are therefore 
justified to provoke the p53 response, and monitor the effects of USP25 on 
p53 levels. 

Moreover, lack of functional E6AP accelerates the polyglutamine- 

15 induced neuronal cell death in the mouse model for the neurodegenerative 
disease Spinocerebellar-ataxia 1 (SCA1) (Cummings et al. 1999). Lack of 
E6AP gene in a mouse expressing the polyglutamine stretch mutation of 
SCA1 protein dramatically reduces the presence of ubiquitinated 
Intranuclear neuronal inclusions, but drastically accelerates the neuronal 

20 degeneration and cell death (Cummings et al. 1999). A very similar effect 
has been observed in Huntington's Disease (HD), where a dominant 
negative mutant of a ubiquitin conjugating enzyme (UBC3). when co- 
expressed in cultured neurons with the huntingtin protein bearing the 
polyglutamine extension (mutation causing HD), drastically reduces the 

25 presence of ubiquitinated intranuclear neuronal inclusions, but drastically 
accelerates the neuronal degeneration and cell death (Saudou et al. 1998). 
If USP25 de-ubiquitinates a similar set of target proteins to the ones 
ubiquitinated by E6AP. then the overexpression of USP25 may lead to 
similar effects as the inhibition of ubiquitin conjugation by E6AP. It is 

30 therefore justified to examine the effects of overexpression of USP25 (and 
other USP-s) on neuronal toxicity. 
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Interaction with human UBC9 and ubiquitin like proteins (UbL)" 

In yeast, there are 13 ubiquitin conjugating enzymes (E2 enzymes) 
(Scheffner et aL 1998). Only two (UBC3. mentioned previously in context 
with HD, and UBC9) are essential for cell cycle progression. Without UBC3 

5 the cell cycle is arrested at the transition point from G1 to S phase, whereas 
without UBC9 the cell cycle is arrested at the transition point from G2 to M 
phase (Scheffner et aL 1998). Yeast UBC9, and its mammalian homologue, 
e.g. human UBC9 have a special function among all other UBC(E2) 
enzymes, in that they are specifically not conjugating to target proteins the 

10 molecule of ubiquitin. but rather of Ubiquitin-Like small proteins (UbL) 

(Gong et al. 1997, Schwarz et al. 1998). The yeast Smt3 and human SUMO- 
1 (PIC1. Sentrin. hSmt3C), SUMO-2 (hSmtSA) and SUMO-3 (hSMT3B) 
belong to the same family of UbL proteins with approximately 50% identity 
between themselves, and some 15-30% identity and 40-60% similarity in 

15 amino acid sequence to ubiquitin (Lapenta et al. 1997, Mannen et al. 1996, 
Kamitani et al. 1998. Saitoh and Hinchey, 2000). Yeast and human UBC9 
are capable of conjugating equally yeast or human UbL-s, but not ubiquitin 
(Schwarz et al. 1998). The SUMO-1,-2 and -3 have the C-terminal glycine, 
necessary for ubiquitination of the target protein's lysine residue, but unlike 

20 ubiquitin. do not have the Lys48 residue necessary for the formation of 
polyubiquitin chains through isopeptide bonds, which are the signal for the 
proteasome degradation (Saitoh and Hinchey. 2000). Nevertheless, yeast 
Smt3 protein can rescue the mutant Mif2 phenotype, a deficient centromere 
binding protein resulting in chromosome missegregation (Meluh and 

25 Koshland 1995). SUMO-1 , as well as SUMO-3 (and probably also SUMO-2) 
are all capable of being attached by UBC9 to RanGapl , a Ran GTP-ase 
activating protein (Kamitani et al. 1998). This ATP-dependent attachment is 
essential for the binding between modified RanGapl and RanBP2 binding 
protein, in order to form functional nuclear pore complex, which controls 

30 export and import of molecules through the nuclear envelope (Mahajan et al. 
1997, Matunis et al. 1998, Lee et al. 1998). In addition, UbL small proteins 



\ISCKDCID; <WO 0079267A2 I. > 



wo 00/79267 PCT/GBOO/02446 

20 

have been shown to modify the death domains of Fas (Okura et al. 1996), 
Tumour necrosis factor receptorl (Okura et al. 1 996), PML (a tumour 
suppressor implicated in the pathogenesis of acute promyelocytic 
leukaemia) (Kamitani et al. 1998b) and Rad51/52 DNA repair proteins (Shen 
5 et al. 1996a). Their conjugating enzyme, UBC9, has been shown to interact 
by Y2H technique with RAD51/52 DNA repair proteins, and the master 
tumour suppressor p53 (Shen et al. 1996b). 

The USP25 Y2H data show clear pattern of interaction in the UBC9 
pathway. Interaction with UBC9 itself, is to our knowledge the first of the kind 

10 demonstration of a direct protein-protein interaction between a USP and a 
conjugating enzyme. Interaction with RanBP2 and SUMO-3 clearly shows 
that USP25 could be sharing the similar target repertoire as UBC9 (Saitoh et 
al. 1997). USP25 may be removing the SUMO (UbL) molecules attached to 
targets by UBC9. Alternatively. USP25 may be preparing the UbL-s for 

15 attachment by UBC9 to targets, by removing the oligopeptide extensions 
after the C-terminal Gly-Gly group from the UbL-s. The only ubiquitin 
protease found to be essential for yeast ceil cycle progression, Uipl , was 
found to be specific to Smt3 removal, and not to ubiquitin (Li and 
Hochstrasser 1999). This protease also had a completely different sequence 

20 from known USP-s and UCH-s. Its human homologues, SENP1 and SUSP-1 , 
were recently cloned (Kim et al. 2000, Gong et al. 2000a), and found to 
specifically cleave SUM0-1.-2 and -3, but not ubiquitin and NEDD8, another 
UbL (Kamitani et al. 1997). The first human enzyme with classical USP 
stmcture (Cys, His domains) for which dual specificity to both ubiquitin and 

25 ubiquitin like protein was demonstrated was very recently published USP25 
on chromosome 1q21 (Gong et al. 2000b). However, opposite from SENP1 
and SUSP-1 , this enzyme cleaves ubiquitin and NeddS, but not SUMO-1,-2 
or -3 (Gong et al. 2000b). 

If USP25 cleaves the same UbL proteins to which it binds by Y2H 

30 (see Table 1 ), it would appear that it Is the first classical USP capable of 
cleaving both ubiquitin and human SUMO family, which maybe very 
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significant to its function. Moreover, it is possible that USP25 is actually 
specific for SUMO-3 rather than SUMO-1 or SUMO-2. The distinction 
between these targets would be the first of the kind, but remains to be further 
confirmed. The previously observed Interaction with hRAD23A (Table 1 ), as 
5 well as with other various precursor proteins containing the ubiquitin-like- 
domains (Table 1 ), could be linked also to its affinity to a certain type of 
ubiquitin like domain. 

Finally, and very importantly, SUMO molecules have been shown to 
attach post-translationally to p53 tumour suppressor itself, by the 
10 conjugating enzyme UBC9 (Rodrigues et al. 1999, Gostissa et al. 1999). 
This modification for SUMO-1 in vitro requires only SUMO-1, the SUMO-1 
activating enzyme and UBC9. The authors of Rodrigues et al. therefore 
conclude that SUMO-1 modification pathway acts as a potential regulator of 
the p53 response and may represent a novel target for the development of 
15 therapeutically useful modulators of the p53 response (Rodrigues et al. 
1999). 

We would propose that the cleavage of ubiquitin like proteins, 
and its association with UbC9 could be an alternative pathway explaining 
USP25's role in cell cycle control, and through it, its role in control of 
20 programmed cell death, with direct implications in both tumour suppression 
and neurodegeneration. 

Figure Legends 

Figure 1. Identification of the Shared Region of Overlap 
25 (S.R.O.) for hemizygous deletions in 21q1 1-q21 in NSCLC.A: Cytogenetic 
map, Not I long range physical map (Ishikawa, et al., 1993), YAC contig 
(Nizetic, et al 1994, Shinizu et al 1995 and Bosch et al 1996), and bacterial 
contig, (Groet et al 1998) are shown in consecutive horizontal layers, 
respectively, above the line showing the markers used in the LOH analysis 
30 (oval symbols). Markers are named as in Genome DataBase (prefix "D21 " 
omitted). In the column under each marker an "X" (symbolizing LOH), "+" 
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(an absence of LOH) and "U" (un-informative, homozygous result) for that 
marker in the set of eleven Croatian Tumour Bank (CTB) tumours, or in 
individual tumours #47 and #61 are shown. NT=not tested. For comparison 
with our data, markers used as probes on the genomic Southern blot of the- 
5 NSCLC cell line, and/or in LOH analysis of fresh tumours in the study by 
Kohno and co-workers (Kohno et al 1998) are indicated above the empty bar 
symbolizing the homozygous deletion they found. In our data, hatched bars 
indicate hemizygous deletions, and black filled bars indicate segments 
showing absence of LOH or deletions. Squared symbols "X" and stand 

10 for predominantly single and predominantly double signal, respectively, 
detected by FISH on interphase nuclei of the paraffin embedded sections of 
the tumour #61, when PAC clones named and indicated as bold lines in the 
PAC contig above the markers line, were used as probes. 

Figure 2. Trapped exons (hatched rectangles) and exons 

15 deduced from overlapping sequence analysis (white rectangles) defining the 
exon-intron structure of the new gene USP25. Top half shows two PACs 
73M5 and 135E14, also used as FISH probes in Fig. 1 , which were the 
source of genomic DNA for exon trapping. Exon locations on the PACs are 
shown with vertical bars, and the 50 kbp scale bar refers to this part. Bottom 

20 half consists of overlapping cDNA fragments corresponding to exons above 
them, drawn in the same scale, (500 bp scale bar is shown). Names of 
cDNA clones are as in dB-EST and UniGene databases, 824710 is the 
address of the clone in the IMAGE Consortium collection. The complete 
cDNA sequence for the whole gene is the new GenBank entry with the 

25 accession number AF 1 342 1 3. 

Figure 3. Comparison of protein sequences of USP25 to 
other eukaryotic members of the superfamily of USP-s. The protein BAP-1 is 
actually from the family of Ubiquitin C-tenninal Hydrolases, a distinct sub- 
family of this superfamily, showing homology only in the single key 

30 aminoacids in the Cys and His domains. Two reports show the localisations 
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of the highly homologous sequences for the HAUSP gene to 3p21 (KashuBa, 
et al 1997) and 16p13 (Robinson, et a/ 1998), respectively. 

Figure 4. Demonstration of the de-ubiquitinating activity of 
USP25 on a model ubiquitin fusion protein. Western blot of an SDS-PAGE 
5 was detected using an anti-p-galactosidase antiserum. Lanes 1,2: the E. coli 
XL-1 blue cells alone (in all cases second line of the pair is +1PTG). Lanes 
3,4: same cells co4ransfected with the model fusion protein encoding 
plasmid pACYC-UB-Met-|3-galactosidase protein, band labelled with an 
asterix). Lanes 5,6: as lanes 3,4 except pQE30-USP25 (full length USP25 
10 gene cloned in the pQE30 expression vector) was added instead of pQE30. 
Lanes 7,8: same as lanes 3,4 except pRBIOS (yeast de-ubiquitinating 
enzyme UBP2) was transfected instead of pQE30. Lanes 9,10: over- 
exposure of lanes 7,8. Note the presence of the 8kDa shorter, de- 
ubiquitinated Met-p-galactosidase (band labelled with a triangle). 

15 
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CLAIMS 

1 . Use of the product of the USP25 gene located at human 
chromosome 21 q 1 1-21 or a non-human homologue thereof or a functional 
fragment of the product manufacture of a medicament for use in the 

5 diagnosis, prophylaxis or treatment of cancer, 

2. Use according to claim 1 in which the cancer Is a solid 
tumour, for instance non-small cell lung carcinoma or skin cancer. 

3. Use according to claim 1 or claim 2 in which the product is 
obtained from a microoganism or cell line in which the gene as recombinant 

10 DNA has been introduced in a form in which it is replicated upon cell 
division, and is transcribed and translated. 

4. Use according to claim 3 in which the recombinant DNA is 
incorporated in a vector which comprises at least a portion of sequence I.D.6 
comprising residues 199 to 3367, optionally including an additional 96 b.p. 

15 exon inserted after base 2356, and optionally including 5' and/or 3* 
untranslated regions (UTR's). 

5. Use of a protein product having Cys, QQD and His domains 
specified in sequence IDs numbers 2. 3 and 4, respectively, in the 
manufacture of a medicament for use in the diagnosis, prophylaxis or 

20 treatment of cancer. 

6. Use according to claim 5 in which the protein has sequence 
ID. No. 1. 

7. Use according to claim 5 in which the protein has sequence 
I.D. No. 5. 

25 8. An in vitro method in which mammalian cells are cultured in 

the presence of a protein product having Cys, QQD and His domains 
specified in sequence IDs numbers 2, 3 and 4, respectively, and the effect of 
protein on cell growth, cell growth arrest and/or apoptosis is assessed. 
9. A method according to claim 8 in which the protein has 

30 sequence ID. 1 or sequence I.D.5. 
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10. Use of DNA including the gene located at human 
chromosome 21 q 1 1-21 (USP25) or a fragment thereof encoding a functional 
USP or UbLSP product in the manufacture of a medicament for use in the 
prophylaxis or treatment of cancer. 
5 11. Pharmaceutical composition comprising a gene therapy 

vehicle and DNA including the gene located at human chromosome 21q 11- 
21 (USP25) or a fragment thereof encoding a functional USP or UbLSP 
product In transcribable form. 

10 
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Sequence Listings 



5 



oEQ ID 1 protein: 

10 MTVEQNVLQQSAAQKHQQTFLNQLREITGINDTQILQQALKDSNGNLELAV 
AFLTAKNAKTPQQEETTYYQTALPGNDRYISVGSQADTNVIDLTGDDKDDL 
QRAIALSLAESNRAFRETGITDEEQAISRVLEASIAENKACLKRTPTEVWRD 
SRNPYDRKRQDKAPVGLKNVGNTCWFSAVIQSLFNLLEFRRLVLNYKPPS 
NAQDLPRNQKEHRNLPFMRELRYLFALLVGTKRKYVDPSRAVEILKDAFK 

15 SNDSQQQDVSEFTHKLLDWLEDAFQMKAEEETDEEKPKNPMVELFYGRF 
LAVGVLEGKKFENTEMFGQYPLQVNGFKDLHECLEAAIVIIEGEIESLHSEN 
SGKSGQEHWFTELPPVLTFELSRFEFNQALGRPEKIHNKLEFPQVLYLDR 
YMHRNREITRIKREEIKRLKDYLTS/LQQRLERYLSYGSGPKRFPLVDVLQYA 
LEFASSKPVCTSPVDDIDASSPPSGSIPSQTLPSTTEQQGALSSELPSTSPS 

20 SVAAISSRSVIHKPFTQSRIPPDLPMHPAPRHITEEELSVLESCLHRWRTEIE 
NDTRDLQESISRIHRTIELMYSDKSMIQVPYRLHAVLVHEGQANAGHYWAY 
IFDHRESRWMKYNDIAVTKSSWEELVRDSFGGYRNASAYCLMYINDKAQFL 
IQEEFNKETGQPLVGIETLPPDLRDFVEEDNQRFEKELEEWDAQLAQKALQ 
EKLLASQKLRESETSVTTAQAAGDPEYLEQPSRSDFSKHLKEETIQIITKASH 

25 EHEDKSPETVLQSAIKLEYARLVKLAQEDTPPETDYRLHHVWYFIQNQAPK 
KIIEKTLLEQFGDRNLSFDERCHNIMKVAQAKLEMIKPEEVNLEEYEEWHQD 
YRKFRETTMYLIIGLENFQRESYIDSLLFLICAYQNNKELLSKGLYRGHDEELI 
SHYRRECLLKLNEQAAELFESGEDREVNNGLIIMNEFIVPFLPLLLVDEMEEK 
DILAVEDMRNRWCSYLGQEMEPHLQEKLTDFLPKLLDCSMEIKSFHEPPKL 

30 PSYSTHELCERFARIMLSLSRTPADGR 
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SEQ ID no 2: (polypeptide) 
GLKNVGNTCWFSAVIQSL 

SEQ ID no 3: (polypeptide) 
5 QQDVSEFTHKLLDWLED 

SEQ ID no 4: (polypeptide) 
YRLHAVLVHEGQANAGHYWAY 

10 SEQ ID no 5: (polypeptide) 

MTVEQNVLQQSAAQKHQQTFLNQLREITGINDTQILQQALKDSNGNLELA 
VAFLTAKNAKTPQQEETTYYQTALPGNDRYISVGSQADTNVIDLTGDDKD 
DLQRAIALSLAESNRAFRETGITDEEQAISRVLEASIAENKACLKRTPTEVW 
RDSRNPYDRKRQDKAPVGLKNVGNTCWFSAVIQSLFNLLEFRRLVLNYKP 

15 PSNAQDLPRNQKEHRNLPFMRELRYLFALLVGTKRKYVDPSRAVEILKDA 
FKSNDSQQQDVSEFTHKLLDWLEDAFQMKAEEETDEEKPKNPMVELFY 
GRFLAVGVLEGKKFENTEMFGQYPLQVNGFKDLHECLEAAMIEGEIESLH 
SENSGKSGQEHWFTELPPVLTFELSRFEFNQALGRPEKIHNKLEFPQVLY 
LDRYMHRNREITRIKREEIKRLKDYLTVLQQRLERYLSYGSGPKRFPLVDV 

20 LQYALEFASSKPVCTSPVDDIDASSPPSGSIPSQTLPSTTEQQGALSSELP 
STSPSSVAAISSRSVIHKPFTQSRIPPDLPMHPAPRHITEEKLSVLESCLHR 
WRTEIENDTRDLQESISRIHRTIELMYSDKSMIQVPYRLHAVLVHEGQANA 
GHYWAYIFDHRESRWMKYNDIAVTKSSWEELVRDSFGGYRNASAYCLM 
YINDKAQFLIQEEFNKETGQPLVGIETLPPDLRDFVEEDNQRFEKELEEW 

25 DAQLAQKALQEKLLASQKLRESETSVTTAQAAGDPEYLEQPSRSDFSKH 
LKEETIQIITKASHEHEDKSPE7VLQSIMMTPNMQGIIMAIGKSRSVYDRCG 
PEAGFFKAIKLEYARLVKLAQEDTPPETDYRLHHWVYFIQNQAPKKIIEKT 
LLEQFGDRNLSFDERCHNIMKVAQAKLEMIKPEEVNLEEYEEWHQDYRK 
FRETTMYLIIGLENFQRESYIDSLLFLICAYQNNKELLSKGLYRGHDEELIS 

30 HYRRECLLKLNEQAAELFESGEDREVNNGLIIMNEFIVPFLPLLLVDEMEE 
KDILAVEDMRNRWCSYLGQEMEPHLQEKLTDFLPKLLDCSMEIKSFHEP 



NSDOCID: <W0, 



.0079267A2_L> 



wo 00/79267 PCT/GBOO/02446 



PKLPSYSTHELCERFARIMLSLSRTPADGR 



5 SEQ ID no 6: 
ds DNA 

>BASECOU^f^ 1223 a 752 c 841g 987 1 
>ORIGIN 

> 1 acagtcggcg tttcgccgcc tgcccgcggt gcccgcgcac gcoggccgcc atcgccttcg 
10 > 61 cgcclggctg gcgggggcgc tgtcctccca ggccgtccgc gccgctccct ggagctcggc 

> 121 ggagcgcggc agccagggcc ggcggaggcg cgaggagccg ggcgccaccg ccgccgccgc 

> 181 cgccgccgcc gcgggggcca tgaccgtgga gcagaacgtg ctgcagcaga gcgcggcgca 

> 241 gaagcaccag cagacgtttt tgaatcaact gagagaaatt acggggatta atgacaccca 

> 301 gatactacag caagccttga aggatagtaa tggaaacttg gaattagcag tggctttcct 

15 > 361 tactgcgaag aatgctaaga cccctcagca ggaggagaca acttactacc aaacagcact 

> 421 tcctggcaat gatagataca tcagtgtggg aagccaagca gatacaaatg tgattgatct 

> 481 cadggagat gataaagatg atcttcagag agcaattgcc ttgagtttgg ccgaatcaaa 

> 541 cagggcattc agggagactg gaataactga tgaggaacaa gccattagca gagttcttga 

> 601 agccagcata gcagagaata aagcatgttt gaagaggaca cctacagaag tttggaggga 
20 > 661 ttctcgaaac ccttatgata gaaaaagaca ggacaaagct cccgttgggc taaagaatgt 

> 721 tggcaatact tgttggttta gtgctgttat tcagtcatta tttaatcttt tggaatttag 

> 781 aagattagtt ctgaattaca agcctccatc aaatgctcaa gatttacccc gaaaccaaaa 

> 841 ggaacalcgg aatttgcctt ttatgcgtga gctgaggtat ctatttgcac ttcttgttgg 

> 901 taccaaaagg aagtatgttg atccatcaag agcagttgaa attcttaagg atgctttcaa 
25 > 961 atcaaatgac tcacagcagc aagatgtgag tgagtttaca cacaaattat tagattggtt 

> 1 021 agaagatgcc ttccaaatga aagctgaaga ggagacggat gaagagaagc caaagaaccc 

> 1081 catggtagag ttgttctatg gcagattcct ggctgtggga gtacttgaag gtaaaaaatt 

> 1141 tgaaaacact gaaatgtttg gtcagtaccc acttcaggtc aatgggttca aagatctgca 

> 1201 tgagtgccta gaagctgcaa tgattgaagg agaaattgag tctttacatt cagagaattc 
30 > 1261 aggaaaatca ggccaagagc attggtttac tgaattacca cctgtgttaa catttgaatt 

> 1321 gtcaagattt gaatttaatc aggcattggg aagaccagaa aaaattcaca acaaattaga 

> 1381 atttccccaa gttttatatt tggacagata catgcacaga aacagagaaa taacaagaat 

> 1441 taagagggaa gagatcaaga gactgaaaga ttacctcacg gtattacaac aaaggctaga 

> 1501 aagatattta agctatggtt cxggtcccaa acgattcccc ttggtagatg ttcttcagta 
35 > 1561 tgcattggaa tttgcctcaa gtaaacctgt ttgcacttct cctgttgacg atattgacgc 

> 1621 tagttcccca cctagtggtt ccataccatc acagacatta ccaagcacaa cagaacaaca 

> 1681 gggagcccta tcttcagaac tgccaagcac atcaccttca tcagttgctg ccatttcatc 

> 1741 gagatcagta atacacaaac catttactca gtcccggata cctccagatt tgcccatgca 
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1801 tccggcacca aggcacataa cggaggaaga actttctgtg ctggaaagtt gtttacatcg 
1861 ctggaggaca gaaatagaaa atgacaccag agatttgcag gaaagcatat ccagaatcca 
1921 tcgaacaatt gaattaatgt actctgacaa atctatgata caagttcctt atcgattaca 
1981 tgccgtttta gttcacgaag gccaagctaa tgctgggcac tactgggcat atatttttga 
2041 tcatcgtgaa agcagatgga tgaagtacaa tgatattgct gtgacaaaat catcatggga 
2101 agagctaglg agggactctt ttggtggtta tagaaatgcc agtgcatact gtttaatgta 
2161 cataaatgat aaggcacagt tcctaataca agaggagttt aataaagaaa ctgggcagcc 
2221 ccttgttggt atagaaacat taccaccgga tttgagagat tttgttgagg aagacaacca 

> 2281 acgatttgaa aaagaactag aagaatggga tgcacaactt gcccagaaag ctttgcagga 
10 > 2341 aaagctttla gcgtctcaga aattgagaga gtcagagact tctgtgacaa cagcacaagc 

> 2401 agcaggagac ccagaatatc tagagcagcc atcaagaagt gatttctcaa agcacttgaa 

> 2461 agaagaaact attcaaataa ttaccaaggc atcacatgag catgaagata aaagtcctga 

> 2521 aacagttttg cagtcggcaa ttaagttgga atatgcaagg ttggttaagt tggcccaaga 

> 2581 agacacccca ccagaaaccg attatcgttt acatcatgta gtggtctact ttatccagaa 
15 > 2641 ccaggcacca aagaaaatta ttgagaaaac attactagaa caatttggag atagaaattt 

2701 gagttttgat gaaaggtgtc acaacataat gaaagttgct caagccaaac tggaaatgat 
2761 aaaacctgaa gaagtaaact tggaggaata tgaggagtgg catcaggatt ataggaaatt 



35 



> 



> 
> 



> 2821 cagggaaaca actatglatc tcataattgg gctagaaaat tltcaaagag aaagttatat 



> 



2881 agattccttg ctgttcctca tctgtgctta tcagaataac aaagaactct tgtctaaagg 
20 > 2941 cttatacaga ggacatgatg aagaattgat atcacattat agaagagaat gtttgctaaa 

> 3001 attaaatgag caagccgcag aactcttcga atctggagag gatcgagaag taaacaatgg 

> 3061 tttgattatc atgaatgagt ttattgtccc atttltgcca ttattactgg tggatgaaat 

> 3121 ggaagaaaag gatatactag ctgtagaaga tatgagaaat cgatggtgtt cctaccttgg 

> 31 81 tcaagaaatg gaaccacacc tccaagaaaa gctgacagat tttttgccaa aactgcttga 
25 > 3241 ttgttctatg gagattaaaa gtttccatga gccaccgaag ttaccttcat attccacgca 

3301 tgaactclgt gagcgatttg cccgaatcat gttgtccctc agtcgaactc ctgctgatgg 



> 



> 3361 aagataaact gcacactttc cdgaacaca ctgtataaac tctttttagt tcttaaccct 



> 
> 



3421 tgccttcctg tcacagggtt tgcttgttgc tgctatagtt tttaactttt ttttatttta 
3481 ataactgcaa aagacaaaat gactatacag actttagtca gactgcagac aataaagctg 
30 > 354 1 aaaatcgcat ggcgctcaga cattttaacc ggaactgatg tataatcaca aatctaattg 
3601 attttattat ggcaaaacta tgcttttgcc accttcclgt tgcagtatta ctttgctttt 
3661 atcttttctt tctcaacagc tttccattca gtctggatcc ttccatgact acagccattt 
3721 aagtgttcag cactgtgtac gatacataat atttggtagc ttgtaaatga aataaagaat 
3781 aaagttttat ttatggctac eta 



> 



SEQ ID no 7 dsDNA 

> ca tgaccgtgga gcagaacgtg dgcagcaga gcgcggcgca 

> 241 gaagcaccag cagacgtttt tgaatcaact gagagaaatt acggggatta atgacaccca 
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> 
> 



> 
> 



301 gatactacag caagccttga aggatagtaa tggaaacttg gaattagcag tggctttcct 
361 tactgcgaag aatgctaaga cccctcagca ggaggagaca acttactacc aaacagcact 
421 tcctggcaat gatagataca tcagtgtggg aagccaagca gatacaaatg tgattgatct 
481 cactggagat gataaagatg atcttcagag agcaattgcc ttgagtttgg ccgaatcaaa 
541 cagggcattc agggagactg gaataactga tgaggaacaa gccattagca gagttcttga 
601 agccagcata gcagagaata aagcatgttt gaagaggaca cctacagaag tttggaggga 
661 ttctcgaaac ccttatgata gaaaaagaca ggacaaagct cccgttgggc taaagaatgt 

> 721 tggcaatact tgttggttta gtgctgttat tcagtcatta tttaatcltt tggaatttag 

> 781 aagattagtt ctgaattaca agcctccatc aaatgctcaa gatttacccc gaaaccaaaa 
10 > 841 ggaacatcgg aatttgcctt ttatgcgtga gctgaggtat ctatttgcac ttcttgttgg 

> 901 taccaaaagg aagtatgttg atccatcaag agcagttgaa attcttaagg atgctttcaa 

> 961 atcaaatgac tcacagcagc aagatgtgag tgagtttaca cacaaattat tagattggtt 

> 1021 agaagatgcc ttccaaatga aagctgaaga ggagacggat gaagagaagc caaagaaccc 

> 1081 catggtagag ttgttctatg gcagattcct ggctgtggga gtacttgaag gtaaaaaatt 
15 > 1141 tgaaaacact gaaatgtttg gtcagtaccc acttcaggtc aatgggttca aagatctgca 

1201 tgagtgccta gaagctgcaa tgattgaagg agaaattgag tclttacatt cagagaattc 
1261 aggaaaatca ggccaagagc attggtttac tgaattacca cctgtgttaa catttgaatt 
1321 gtcaagattt gaatttaatc aggcattggg aagaccagaa aaaattcaca acaaattaga 
1381 atttccccaa gttttatatt tggacagata catgcacaga aacagagaaa taacaagaat 
20 > 1441 taagagggaa gagatcaaga gactgaaaga ttacctcacg gtattacaac aaaggctaga 
1501 aagatattta agctatggtt ccggtcccaa acgattcccc ttggtagatg ttcttcagta 
1561 tgcattggaa tttgcctcaa gtaaacctgt ttgcadtct cctgttgacg alattgacgc 

> 1621 tagttcccca cctagtggtt ccataccatc acagacatta ccaagcacaa cagaacaaca 

> 1681 gggagcccta tcttcagaac tgccaagcac atcaccttca tcagttgctg ccatttcatc 
25 > 1741 gagatcagta atacacaaac catttactca gtcccggata cctccagatt tgcccatgca 

> 1801 tccggcacca aggcacataa cggaggaaga actttctgtg ctggaaagtt gtttacatcg 

> 1861 ctggaggaca gaaatagaaa atgacaccag agatttgcag gaaagcatat ccagaatcca 
1921 tcgaacaatt gaattaatgt actctgacaa atctatgata caagttcctt atcgattaca 
1 981 tgccgtttta gttcacgaag gccaagctaa tgctgggcac tactgggcat atatttttga 

30 > 2041 tcatcgtgaa agcagatgga tgaagtacaa tgatattgct gtgacaaaat catcatggga 
2101 agagctagtg agggactctt ttggtggtta tagaaatgcc agtgcatact gtttaatgta 
2161 cataaatgat aaggcacagt tcctaataca agaggagttt aataaagaaa ctgggcagcc 

> 2221 ccttgttggt atagaaacat taccaccgga tttgagagat tttgttgagg aagacaacca 

> 2281 acgatttgaa aaagaactag aagaatggga tgcacaactt gcccagaaag ctttgcagga 
35 > 2341 aaagctttta gcgtctcaga aattgagaga gtcagagact tctgtgacaa cagcacaagc 

> 2401 agcaggagac ccagaatatc tagagcagcc atcaagaagt gatttctcaa agcacttgaa 

> 2461 agaagaaad attcaaataa ttaccaaggc atcacatgag catgaagata aaagtcctga 

> 2521 aacagttttg cagtcggcaa ttaagttgga atatgcaagg ttggttaagt tggcccaaga 
2581 agacacccca ccagaaaccg attatcgttt acatcatgta gtggtctad ttatccagaa 
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> 2641 ccaggcacca aagaaaatta ttgagaaaac attactagaa caatttggag atagaaattt 

> 2701 gagttttgat gaaaggtgtc acaacataat gaaagtlgct caagccaaac tggaaatgat 

> 2761 aaaacctgaa gaagtaaact tggaggaala tgaggagtgg catcaggatt ataggaaatt 

> 2821 cagggaaaca actatgtatc tcataattgg gctagaaaat tttcaaagag aaagttatat 
5 > 2881 agattccttg ctgttcctca tctgtgctta tcagaataac aaagaactct tgtctaaagg 

> 2941 cttatacaga ggacatgatg aagaattgat atcacattat agaagagaat gtttgctaaa 

> 3001 attaaatgag caagccgcag aactcttcga atctggagag gatcgagaag taaacaatgg 

> 3061 tttgattatc algaatgagt ttattgtccc atttttgcca ttattactgg tggatgaaat 

> 3121 ggaagaaaag gatatactag ctgtagaaga tatgagaaat cgatggtgtt cctaccttgg 
10 > 3181 tcaagaaatg gaaccacacc tccaagaaaa gctgacagat tttttgccaa aactgcttga 

> 3241 ttgttctatg gagattaaaa gtttccatga gccaccgaag ttaccltcat attccacgca 

> 3301 tgaactctgt gagcgatttg cccgaalcat gttgtccctc agtcgaactc ctgclgatgg 

> 3361 aagataa 
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<210> 1 
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<212> PRT 

<213> Homo sapiens 

<400> 1 

Met Thr Val Glu Gin Asn Val Leu Gin Gin Ser Ala Ala Gin Lys His 
15 10 15 

Gin Gin Thr Phe Leu Asn Gin Leu Arg Glu He Thr Gly He Asn Asp 
20 25 30 

Thr Gin He Leu Gin Gin Ala Leu Lys Asp Ser Asn Gly Asn Leu Glu 
35 40 45 

Leu Ala Val Ala Phe Leu Thr Ala Lys Asn Ala Lys Thr Pro Gin Gin 
50 55 60 

Glu Glu Thr Thr Tyr Tyr Gin Thr Ala Leu Pro Gly Asn Asp Arg Tyr 
65 70 75 80 

He Ser Val Gly Ser Gin Ala Asp Thr Asn Val He Asp Leu Thr Gly 
85 90 95 

Asp Asp Lys Asp Asp Leu Gin Arg Ala He Ala Leu Ser Leu Ala Glu 
100 105 110 
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Ser Asn Arg Ala Phe Arg Glu Thr Gly lie Thr Asp Glu Glu Gin Ala 
115 120 125 - - 

lie Ser Arg Val Leu Glu Ala Ser lie Ala Glu Asn Lys Ala Cys Leu 
130 135 140 

Lys Arg Thr Pro Thr Glu Val Trp Arg Asp Ser Arg Asn Pro Tyr Asp 
145 150 155 160 

Arg Lys Arg Gin Asp Lys Ala Pro Val Gly Leu Lys Asn Val Gly Asn 
165 170 175 

Thr Cys Trp Phe Ser Ala Val He Gin Ser Leu Phe Asn Leu Leu Glu 
180 185 190 

Phe Arg Arg Leu Val Leu Asn Tyr Lys Pro Pro Ser Asn Ala Gin Asp 
195 200 205 

Leu Pro Arg Asn Gin Lys Glu His Arg Asn Leu Pro Phe Met Arg Glu 
210 215 220 

Leu Arg Tyr Leu Phe Ala Leu Leu Val Gly Thr Lys Arg Lys Tyr Val 
225 230 235 240 

Asp Pre Ser Arg Ala Val Glu He Leu Lys Asp Ala Phe Lys Ser Asn 
245 250 255 

Asp Ser Gin Gin Gin Asp Val Ser Glu Phe Thr His Lys Leu Leu Asp 
260 265 270 

Trp Leu Glu Asp Ala Phe Gin Met Lys Ala Glu Glu Glu Thr Asp Glu 
275 280 285 

Glu Lys Pre Lys Asn Pro Met Val Glu Leu Phe Tyr Gly Arg Phe Leu 
290 295 300 

Ala Val Gly Val Leu Glu Gly Lys Lys Phe Glu Asn Thr Glu Met Phe 
305 310 315 320 

Gly Gin Tyr Pro Leu Gin Val Asn Gly, Phe Lys Asp Leu His Glu Cys 
325 330 335 

Leu Glu Ala Ala Met He Glu Gly Glu He Glu Ser Leu His Ser Glu 
340 345 3*50 

Asn Ser Gly Lys Ser Gly Gin Glu His Trp Phe Thr Glu Leu Pro Pro 
355 360 365 
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Val Leu Thr Phe Glu Leu Ser Arg Phe Glu Phe Asn Gin Ala Leu Gly 
370 375 380 ' _ 

Arg Pro Glu Lys lie His Asn Lys Leu Glu Phe Pro Gin Val Leu Tyr 
385 390 395 400 

Leu Asp Arg Tyr Met His Arg Asn Arg Glu He Thr Arg He Lys Arg 
405 410 415 

Glu Glu He Lys Arg Leu Lys Asp Tyr Leu Thr Val Leu Gin Gin Arg 
420 425 430 

Leu Glu Arg Tyr Leu Ser Tyr Gly Ser Gly Pro Lys Arg Phe Pro Leu 
435 440 445 

Val Asp Val Leu Gin Tyr Ala Leu Glu Phe Ala Ser Ser Lys Pro Val 
450 455 460 

Cys Thr Ser Pro Val Asp Asp He Asp Ala Ser Ser Pro Pro Ser Gly 
465 470 475 480 

Ser He Pro Ser Gin Thr Leu Pro Ser Thr Thr Glu Gin Gin Gly Ala 
485 490 495 

Leu Ser Ser Glu Leu Pro Ser Thr Ser Pro Ser Ser Val Ala Ala He 
500 505 510 

Ser Ser Arg Ser Val He His Lys Pro Phe Thr Gin Ser Arg He Pro 
515 520 525 

Pro Asp Leu Pro Met His Pro Ala Pro Arg His He Thr Glu Glu Glu 
530 535 540 

Leu Ser Val Leu Glu Ser Cys Leu His Arg Trp Arg Thr Glu He Glu 
545 550 555 560 

Asn Asp Thr Arg Asp Leu Gin Glu Ser He Ser Arg He His Arg Thr 
565 570 575 

He Glu Leu Met Tyr Ser Asp Lys Ser Met He Gin Val Pro Tyr Arg 
580 585 590 

Leu His Ala Val Leu Val His Glu Gly Gin Ala Asn Ala Gly His Tyr 
595 600 605 

Trp Ala Tyr He Phe Asp His Arg Glu Ser Arg Trp Met Lys Tyr Asn 
610 615 620 
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Asp lie Ala Val Thr Lys Ser Ser Trp Glu Glu Leu Val Arg Asp Ser 
625 630 635 640^ 

Phe Gly Gly Tyr Arg Asn Ala Ser Ala Tyr Cys Leu Met Tyr lie Asn 
645 650 655 

Asp Lys Ala Gin Phe Leu lie Gin Glu Glu Phe Asn Lys gIu Thr Gly 
660 665 670 

Gin Pro Leu Val Gly lie Glu Thr Leu Pro Pro Asp Leu Arg Asp Phe 
675 680 685 

Val Glu Glu Asp Asn Gin Arg Phe Glu Lys Glu Leu Glu Glu Trp Asp 
690 695 700 

Ala Gin Leu Ala Gin Lys Ala Leu Gin Glu Lys Leu Leu Ala Ser Gin 
705 710 715 720 

Lys Leu Arg Glu Ser Glu Thr Ser Val Thr Thr Ala Gin Ala Ala Gly 
725 730 735 

Asp Pro Glu Tyr Leu Glu Gin Pro Ser Arg Ser Asp Phe Ser Lys His 
740 745 750 

Leu Lys Glu Glu Thr lie Gin lie lie Thr Lys Ala Ser His Glu His 
755 760 765 

Glu Asp Lys Ser Pro Glu Thr Val' Leu Gin Ser Ala He Lys Leu Glu 
770 775 780 

Tyr Ala Arg Leu Val Lys Leu Ala Gin Glu Asp Thr Pro Pro Glu Thr 
785 790 795 800 

Asp Tyr Arg Leu His His Val Val Val Tyr Phe He Gin Asn Gin Ala 
805 810 815 

Pro Lys Lys He He Glu Lys Thr Leu Leu Glu Gin Phe Gly Asp Arg 
820 825 830 

Asn Leu Ser Phe Asp Glu Arg Cys His Asn He Met Lys Val Ala Gin 
835 840 845 

Ala Lys Leu Glu Met He Lys Pro Glu Glu Val Asn Leu Glu Glu Tyr 
850 855 860 

Glu Glu Trp His Gin Asp Tyr Arg Lys Phe Arg Glu Thr Thr Met Tyr 
865 870 875 880 
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Leu lie lie Gly Leu Glu Asn Phe Gin Arg Glu Ser Tyr He Asp Ser 
885 890 895 

Leu Leu Phe Leu He Cys Ala Tyr Gin Asn Asn Lys Glu Leu Leu Ser 
900 905 910 

Lys Gly Leu Tyr Arg Gly His Asp Glu Glu Leu He Ser His Tyr Arg 
915 920 925 

Arg Glu Cys Leu Leu Lys Leu Asn Glu Gin Ala Ala Glu Leu Phe Glu 
930 935 940 

Ser Gly Glu Asp Arg Glu Val Asn Asn Gly Leu He He Met Asn Glu 
945 950 955 960 

Phe He Val Pro Phe Leu Pro Leu Leu Leu Val Asp Glu Met Glu Glu 
965 970 975 

Lys Asp He Leu Ala Val Glu Asp Met Arg Asn Arg Trp Cys Ser Tyr 
980 985 990 

Leu Gly Gin Glu Met Glu Pro His Leu Gin Glu Lys Leu Thr Asp Phe 
995 1000 1005 

Leu Pro Lys Leu Leu Asp Cys Ser Met Glu He Lys Ser Phe His Glu 
1010 1015 1020 

Pro Pro Lys Leu Pro Ser Tyr Ser Thr His Glu Leu Cys Glu Arg Phe 
1025 1030 1035 1040 

Ala Arg He Met Leu Ser Leu Ser Arg Thr Pro Ala Asp Gly Arg 
1045 1050 1055 



<210> 2 

<211> 18 
<212> PRT 

<213> Homo sapiens 

<400> 2 

Gly Leu Lys Asn Val Gly Asn Thr Cys Trp Phe Ser Ala Val He Gin 
1-5 10 15 

Ser Leu 
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<210> 3 
<211> 17 

<212> PRT 

<213> Homo sapiens 

<400> 3 

Gin Gin Asp Val Ser Glu Phe Thr His Lys Leu Leu Asp Trp Leu Glu 
15 10 15 

Asp 



<210> 4 

<211> 21 

<212> PRT 

<213> Homo sapiens 

<400> 4 

Tyr Arg Leu His Ala Val Leu Val His Glu Gly Gin Ala Asn Ala Gly 
15 10 15 

His Tyr Trp Ala Tyr 
20 



<210> 5 

<211> 1087 

<212> PRT 

<213> Homo sapiens 

<400> 5 

Met Thr Val Glu Gin Asn Val Leu Gin Gin Ser Ala Ala Gin Lys His 
15 10 15 

Gin Gin Thr Phe Leu Asn Gin Leu Arg Glu lie Thr Gly lie Asn Asp 
20 25 30 

Thr Gin lie Leu Gin Gin Ala Leu Lys Asp Ser Asn Gly Asn Leu Glu 
35 40 45 

Leu Ala Val Ala Phe Leu Thr Ala Lys Asn Ala Lys Thr Pro Gin Gin 
50 55 60 

Glu Glu Thr Thr Tyr Tyr Gin Thr Ala Leu Pro Gly Asn Asp Arg Tyr 
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65 70 75 80 

He Ser Val Gly Ser Gin Ala Asp Thr Asn Val He Asp Leu Thr Gly 
85 90 95 

Asp Asp Lys Asp Asp Leu Gin Arg Ala He Ala Leu Ser Leu Ala Glu 
100 105 110 

Ser Asn Arg Ala Phe Arg Glu Thr Gly He Thr Asp Glu Glu Gin Ala 
115 120 125 

He Ser Arg- Val Leu Glu Ala Ser He Ala Glu Asn Lys Ala Cys Leu 
130 135 140 

Lys Arg Thr Pro Thr Glu Val Trp Arg Asp Ser Arg Asn Pro Tyr Asp 
145 150 155 160 

Arg Lys Arg Gin Asp Lys Ala Pro Val Gly Leu Lys Asn Val Gly Asn 
165 170 175 

Thr Cys Trp Phe Ser Ala Val He Gin Ser Leu Phe Asn Leu Leu Glu 
180 185 190 

Phe Arg Arg Leu Val Leu Asn Tyr Lys Pro Pro Ser Asn Ala Gin Asp 
195 200 205 

Leu Pro Arg Asn Gin Lys Glu His Arg Asn Leu Pro Phe Met Arg Glu 
210 215 220 

Leu Arg Tyr Leu Phe Ala Leu Leu Val Gly Thr Lys Arg Lys Tyr Val 
225 230 235 240 

Asp Pro Ser Arg Ala Val Glu He Leu Lys Asp Ala Phe Lys Ser Asn 
245 250 255 

Asp Ser Gin Gin Gin Asp Val Ser Glu Phe Thr His Lys Leu Leu Asp 
260 265 270 

Trp Leu Glu Asp Ala Phe Gin Met Lys Ala Glu Glu Glu Thr Asp Glu 
275 280 285 

Glu Lys Pro Lys Asn Pro Met Val Glu Leu Phe Tyr Gly Arg Phe Leu 
290 295 300 

Ala Val Gly Val Leu Glu Gly Lys Lys Phe Glu Asn Thr Glu Met Phe 
305 310 315 320 

Gly Gin Tyr Pro Leu Gin Val Asn Gly Phe Lys Asp Leu His Glu Cys 
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Leu Glu Ala Ala Met lie Glu Gly Glu lie Glu Ser Leu His Ser Glu 
340 345 350 

Asn Ser Gly Lys Ser Gly Gin Glu His Trp Phe Thr Glu Leu Pro Pro 
355 360 365 

Val Leu Thr Phe Glu Leu Ser Arg Phe Glu Phe Asn Gin Ala Leu Gly 
370 375 380 

Arg Pro Glu Lys lie His Asn Lys Leu Glu Phe Pro Gin Val Leu Tyr 
385 390 395 400 

Leu Asp Arg Tyr Met His Arg Asn Arg Glu lie Thr Arg lie Lys Arg 
405 410 415 

Glu Glu lie Lys Arg Leu Lys Asp Tyr Leu Thr Val Leu Gin Gin Arg 
420 425 430 

Leu Glu Arg Tyr Leu Ser Tyr Gly Ser Gly Pro Lys Arg Phe Pro Leu 
435 440 445 

Val Asp Val Leu Gin Tyr Ala Leu Glu Phe Ala Ser Ser Lys Pro Val 
450 455 460 

Cys Thr Ser Pro Val Asp Asp lie Asp ;U.a Ser Ser Pro Pro Ser Gly 
465 470 475 480 

Ser He Pro Ser Gin Thr Leu Pro Ser Thr Thr Glu Gin Gin Gly Ala 
485 490 495 

Leu Ser Ser Glu Leu Pro Ser Thr Ser Pro Ser Ser Val Ala Ala He 
500 505 510 

Ser Ser Arg Ser Val He His Lys Pro Phe Thr Gin Ser Arg He Pro 
515 520 525 

Pro Asp Leu Pro Met His Pro Ala Pro Arg His He Thr Glu Glu Lys 
530 535 540 

Leu Ser Val Leu Glu Ser Cys Leu His Arg Trp Arg Thr Glu He Glu 
545 550 555 560 

Asn Asp Thr Arg Asp Leu Gin Glu Ser He Ser Arg He His Arg Thr 
565 570 575 

He Glu Leu Met Tyr Ser Asp Lys Ser Met He Gin Val Pro Tyr Arg 
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580 585 590 

Leu His Ala Val Leu Val His Glu Gly Gin Ala Asn Ala Gly His Tyr 
595 600 605 

Trp Ala Tyr lie Phe Asp His Arg Glu Ser Arg Trp Met Lys Tyr Asn 
610 615 620 

Asp lie Ala Val Thr Lys Ser Ser Trp Glu Glu Leu Val Arg Asp Ser 
625 630 635 640 

Phe Gly Gly Tyr Arg Asn Ala Ser Ala Tyr Cys Leu Met Tyr lie Asn 
645 650 655 

Asp Lys Ala Gin Phe Leu lie Gin Glu Glu Phe Asn Lys Glu Thr Gly 
660 665 670 

Gin Pro Leu Val Gly lie Glu Thr Leu Pro Pro Asp Leu Arg Asp Phe 
675 680 685 

Val Glu Glu Asp Asn Gin Arg Phe Glu Lys Glu Leu Glu Glu Trp Asp 
690 695 700 

Ala Gin Leu Ala Gin Lys Ala Leu Gin Glu Lys Leu Leu Ala Ser Gin 
705 710 715 720 

Lys Leu Arg Glu Ser Glu Thr Ser Val Thr Thr Ala Gin Ala Ala Gly 
725 730 735 

Asp Pro Glu Tyr Leu Glu Gin Pro Ser Arg Ser Asp Phe Ser Lys His 
740 745 750 

Leu Lys Glu Glu Thr He Gin He He Thr Lys Ala Sets His Glu His 
755 760 765 

Glu Asp Lys Ser Pro Glu Thr Val Leu Gin Ser He Met Met Thr Pro 
770 775 780 

Asn Met Gin Gly He He Met Ala He Gly Lys Ser Arg Ser Val Tyr 
785 ' 790 795 800 

Asp Arg Cys Gly Pro Glu Ala Gly Phe Phe Lys Ala He Lys Leu Glu 
805 810 815 

Tyr Ala Arg Leu Val Lys Leu Ala Gin Glu Asp Thr Pro Pro Glu Thr 
820 825 830 

Asp Tyr Arg Leu His His Val Val Val Tyr Phe He Gin Asn Gin Ala 
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835 840 845 

Pro Lys Lys He He Glu Lys Thr Leu Leu Glu Gin Phe Gly Asp Arg 
850 855 860 

Asn Leu Ser Phe Asp Glu Arg Cys His Asn He Met Lys Val Ala Gin 
865 870 875 880 

Ala Lys Leu Glu Met He Lys Pro Glu Glu Val Asn Leu Glu Glu Tyr 
885 890 895 

Glu Glu Trp His Gin Asp Tyr Arg Lys Phe Arg Glu Thr Thr Met Tyr 
90C 905 910 

Leu He He Gly Leu Glu Asn Phe Gin Arg Glu Ser Tyr He Asp Ser 
915 920 925 

Leu Leu Phe Leu lie Cys Ala Tyr Gin Asn Asn Lys Glu Leu Leu Ser 
930 935 940 

Lys Gly Leu Tyr Arg Gly His Asp Glu Glu Leu He Ser His Tyr Arg 
945 950 955 960 

Arg Glu Cys Leu Leu Lys Leu Asn Glu Gin Ala Ala Glu Leu Phe Glu 
965 970 975 

Ser Gly Glu Asp Arg Glu Val Asn Asn Gly Leu He He Met Asn Glu 
980 985 990 

Phe He Val Pro Phe Leu Pro Leu Leu Leu Val Asp Glu Met Glu Glu 
995 1000 1005 

Lys Asp He Leu Ala Val Glu Asp Met Arg Asn Arg Trp Cys Ser Tyr 
1010 1015 1020 

Leu Gly Gin Glu Met Glu Pro His Leu Gin Glu Lys Leu Thr Asp Phe 
1025 1030 1035 1040 

Leu Pro Lys Leu Leu Asp Cys Ser Met Glu He Lys Ser Phe His Glu 
1045 1050 1055 

Pro Pro Lys Leu Pre Ser Tyr Ser Thr His Glu Leu Cys Glu Arg Phe 
1060 1065 1070 

Ala Arg He Met Leu Ser Leu Ser Arg Thr Pro Ala Asp Gly Arg 
1075 1080 1085 
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<210> 6 . ^ 

<211> 3803 

<212> DNA 

<213> Homo sapiens 

<400> 6 

acagtcggcg tttcgccgcc tgcccgcggt gcccgcgcac gccggccgcc atcgccttcg 60 
cgcctggctg gcgggggcgc tgtcctccca ggccgtccgc gccgctccct ggagctcggc 120 
ggagcgcggc agccagggcc ggcggaggcg cgaggagccg ggcgccaccg ccgccgccgc 180 
cgccgccgcc gcgggggcca tgaccgtgga gcagaacgtg ctgcagcaga gcgcggcgca 240 
gaagcaccag cagacgtttt tgaatcaact gagagaaatt acggggatta atgacaccca 300 
gatactacag caagccttga aggatagtaa tggaaacttg gaattagcag tggctttcct 360 
tactgcgaag aatgctaaga cccctcagca ggaggagaca acttactacc aaacagcact 420 
tcctggcaat gatagataca tcagtgtggg aagccaagca gatacaaatg tgattgatct 480 
cactggagat gataaagatg atcttcagag agcaattgcc ttgagtttgg ccgaatcaaa 540 
cagggcattc agggagactg gaataactga tgaggaacaa gccattagca gagttcttga 600 
agccagcata gcagagaata aagcatgttt gaagaggaca cctacagaag tttggaggga 660 
ttctcgaaac ccttatgata gaaaaagaca ggacaaagct cccgttgggc taaagaatgt 720 
tggcaatact tgttggttta gtgctgttat tcagtcatta tttaatcttt tggaatttag 780 
aagattagtt ctgaattaca agcctccatc aaatgctcaa gatttacccc gaaaccaaaa 840 
ggaacatcgg aatttgcctt ttatgcgtga gctgaggtat ctatttgcac ttcttgttgg 900 
taccaaaagg aagtatgttg atccatcaag agcagttgaa attcttaagg atgctttcaa 960 
atcaaatgac tcacagcagc aagatgtgag tgagtttaca cacaaattat tagattggtt 1020 
agaagatgcc ttccaaatga aagctgaaga ggagacggat gaagagaagc caaagaaccc 1080 
catggtagag ttgttctatg gcagattcct ggctgtggga gtacttgaag gtaaaaaatt 1140 
tgaaaacact gaaatgtttg gtcagtaccc acttcaggtc aatgggttca aagatctgca 1200 
tgagtgccta gaagctgcaa tgattgaagg agaaattgag tctttacatt cagagaattc 1260 
aggaaaatca ggccaagagc attggtttac tgaattacca cctgtgttaa catttgaatt 1320 
gtcaagattt gaatttaatc aggcattggg aagaccagaa aaaattcaca acaaattaga 1380 
atttccccaa gttttatatt tggacagata catgcacaga aacagagaaa taacaagaat 1440 
taagagggaa gagatcaaga gactgaaaga ttacctcacg gtattacaac aaaggctaga 1500 
aagatattta agctatggtt ccggtcccaa acgattcccc ttggtagatg ttcttcagta 1560 
tgcattggaa tttgcctcaa gtaaacctgt t.tgcacttct cctgttgacg atattgacgc 1620 
tagttcccca cctagtggtt ccataccatc acagacatta ccaagcacaa cagaacaaca 1680 
gggagcccta tcttcagaac tgccaagcac atcaccttca tcagttgctg ccatttcatc 1740 
gagatcagta atacacaaac catttactca gtcccggata cctccagatt tgcccatgca 1800 
tccggcacca aggcacataa cggaggaaga actttctgtg ctggaaagtt gtttacatcg 1860 
ctggaggaca gaaatagaaa atgacaccag agatttgcag gaaagcatat ccagaatcca 1920 
tcgaacaatt gaattaatgt actctgacaa atctatgata caagttcctt atcgattaca 1980 
tgccgtttta gttcacgaag gccaagctaa tgctgggcac tactgggcat atatttttga 2040 
tcatcgtgaa agcagatgga tgaagtacaa tgatattgct gtgacaaaat catcatggga 2100 
agagctagtg agggactctt ttggtggtta tagaaatgcc agtgcatact gtttaatgta 2160 
cataaatgat aaggcacagt tcctaataca agaggagttt aataaagaaa ctgggcagcc 2220 
ccttgttggt atagaaacat taccaccgga tttgagagat tttgttgagg aagacaacca 2280 
acgatttgaa aaagaactag aagaatggga tgcacaactt gcccagaaag ctttgcagga 2340 
aaagctttta gcgtctcaga aattgagaga gtcagagact tctgtgacaa cagcacaagc 2400 
agcaggagac ccagaatatc tagagcagcc atcaagaagt gatttctcaa agcacttgaa 2460 
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agaagaaact attcaaataa ttaccaaggc atcacatgag catgaagata aaagtcctga 2520 

aacagttttg cagtcggcaa ttaagttgga atatgcaagg ttggttaagt tggccca§ga 2580 

agacacccca ccagaaaccg attatcgttt acatcatgta gtggtctact ttatccagaa 2640 

ccaggcacca aagaaaatta ttgagaaaac attactagaa caatttggag atagaaattt 2700 

gagttttgat gaaaggtgtc acaacataat gaaagttgct caagccaaac tggaaatgat 2760 

aaaacctgaa gaagtaaact tggaggaata tgaggagtgg catcaggatt ataggaaatt 2820 

cagggaaaca actatgtatc tcataattgg gctagaaaat tttcaaagag aaagttatat 2880 

agattccttg ctgttcctca tctgtgctta tcagaataac aaagaactct tgtctaaagg 2940 

cttatacaga ggacatgatg aagaattgat atcacattat agaagagaat gtttgctaaa 3000 

attaaatgag caagccgcag aactcttcga atctggagag gatcgagaag taaacaatgg 3060 

tttgattatc atgaatgagt ttattgtccc atttttgcca ttattactgg tggatgaaat 3120 

ggaagaaaag gatatactag ctgtagaaga tatgagaaat cgatggtgtt cctaccttgg 3180 

tcaagaaatg gaaccacacc tccaagaaaa gctgacagat tttttgccaa aactgcttga 3240 

ttgttctatg gagattaaaa gtttccatga gccaccgaag ttaccttcat attccacgca 3300 

tgaactctgt gagcgatttg cccgaatcat gttgtccctc agtcgaactc ctgctgatgg 3360 

aagataaact gcacactttc cctgaacaca ctgtataaac tctttttagt tcttaaccct 3420 

tgccttcctg tcacagggtt tgcttgttgc tgctatagtt tttaactttt ttttatttta 3480 

ataactgcaa aagacaaaat gactatacag actttagtca gactgcagac aataaagctg 3540 

aaaatcgcat ggcgctcaga cattttaacc ggaactgatg tataatcaca aatctaattg 3600 

attttattat ggcaaaacta tgcttttgcc accttcctgt tgcagtatta ctttgctttt 3660 

atcttttctt tctcaacagc tttccattca gtctggatcc ttccatgact acagccattt 3720 

aagtgttcag cactgtgtac gatacataat atttggtagc ttgtaaatga aataaagaat 3780 

aaagttttat ttatggctac eta 3803 



<210> 7 

<211> 3169 

<212> DNA 

<213> Homo sapiens 

<400> 7 

catgaccgtg gagcagaacg tgctgcagca gagcgcggcg cagaagcacc agcagacgtt 60 

tttgaatcaa ctgagagaaa ttacggggat taatgacacc cagatactac agcaagcctt 120 

gaaggatagt aatggaaact tggaattagc agtggctttc cttactgcga agaatgctaa 180 

gacccctcag caggaggaga caacttacta ccaaacagca cttcctggca atgatagata 240 

catcagtgtg ggaagccaag cagatacaaa tgtgattgat ctcactggag atgataaaga 300 

tgatcttcag agagcaattg ccttgagttt ggccgaatca aacagggcat tcagggagac 360 

tggaataact gatgaggaac aagccattag cagagttctt gaagccagca tagcagagaa 420 

taaagcatgt ttgaagagga cacctacaga agtttggagg gattctcgaa acccttatga 480 

tagaaaaaga caggacaaag ctcccgttgg gctaaagaat gttggcaata cttgttggtt 540 

tagtgctgtt attcagtcat tatttaatct tttggaattt agaagattag ttctgaatta 600 

caagcctcca tcaaatgctc aagatttacc ccgaaaccaa aaggaacatc ggaatttgcc 660 

ttttatgcgt gagctgaggt atctatttgc acttcttgtt ggtaccaaaa ggaagtatgt 720 

tgatccatca agagcagttg aaattcttaa ggatgctttc aaatcaaatg actcacagca 780 

gcaagatgtg agtgagttta cacacaaatt attagattgg ttagaagatg ccttccaaat 840 

gaaagctgaa gaggagacgg atgaagagaa gccaaagaac cccatggtag agttgttcta 900 

tggcagattc ctggctgtgg gagtacttga aggtaaaaaa tttgaaaaca ctgaaatgtt 960 

tggtcagtac ccacttcagg tcaatgggtt caaagatctg catgagtgcc tagaagctgc 1020 
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aatgattgaa ggagaaattg agtctttaca 
gcattggttt actgaattac cacctgtgtt 
tcaggcattg ggaagaccag aaaaaattca 
tttggacaga tacatgcaca gaaacagaga 
gagaccgaaa gattacctca cggtattaca 
ttccggtccc aaacgattcc ccttggtaga 
aagtaaacct gtttgcactt ctcctgttga 
ttccatacca tcacagacat taccaagcac 
actgccaagc acatcacctt catcagttgc 
accatttact cagtcccgga tacctccaga 
aacggaggaa gaactttctg tgctggaaag 
aaatgacacc agagatttgc aggaaagcat 
gtactctgac aaatctatga tacaagttcc 
aggccaagct aatgctgggc actactgggc 
gatgaagtac aatgatattg ctgtgacaaa 
ttttggtggt tatagaaatg ccagtgcata 
gttcctaata caagaggagt ttaataaaga 
attaccaccg gatttgagag attttgttga 
agaagaatgg gatgcacaac tcgcccagaa 
gaaattgaga gagtcagaga cttctgtgac 
tctagagcag ccatcaagaa gtgatttctc 
aattaccaag gcatcacatg agcatgaaga 
aattaagttg gaatatgcaa ggttggttaa 
cgattatcgt ttacatcatg tagtggtcta 
tattgagaaa acattactag aacaatttgg 
tcacaacata atgaaagttg ctcaagccaa 
cttggaggaa tatgaggagt ggcatcagga 
tctcataatt gggctagaaa attttcaaag 
catctgtgct tatcagaata acaaagaact 
tgaagaattg atatcacatt atagaagaga 
agaactcttc gaatctggag aggatcgaga 
gtttattgtc ccatttttgc cattattact 
agctgtagaa gatatgagaa atcgatggtg 
cctccaagaa aagctgacag attttttgcc 
aagtttccat gagccaccga agttaccttc 
tgcccgaatc atgttgtccc tcagtcgaac 



ttcagagaat tcaggaaaat caggccaaga 1080 
aacatttgaa ttgtcaagat ttgaatttaa 1140 
caacaaatta gaatttcccc aagttttata 1200 
aataacaaga attaagaggg aagagatcaa 1260 
acaaaggcta gaaagatatt taagctatgg 1320 
tgttcttcag tatgcattgg aatttgcctc 1380 
cgatattgac gctagttccc cacctagtgg 1440 
aacagaacaa cagggagccc tatcttcaga 1500 
tgccatttca tcgagatcag taatacacaa 1560 
tttgcccatg catccggcac caaggcacat 1620 
ttgtttacat cgctggagga cagaaataga 1680 
atccagaatc catcgaacaa ttgaattaat 1740 
ttatcgatta catgccgttt tagttcacga 1800 
atatattttt gatcatcgtg aaagcagatg 1860 
atcatcatgg gaagagctag tgagggactc 1920 
ctgtttaatg tacataaatg ataaggcaca 1980 
aactgggcag ccccttgttg gtatagaaac 2040 
ggaagacaac caacgatttg aaaaagaact 2100 
agctttgcag gaaaagcttt tagcgtctca 2160 
aacagcacaa gcagcaggag acccagaata 2220 
aaagcacttg aaagaagaaa ctattcaaat 2280 
taaaagtcct gaaacagttt tgcagtcggc 2340 
gttggcccaa gaagacaccc caccagaaac 2400 
ctttatccag aaccaggcac caaagaaaat 2460 
agatagaaat ttgagttttg atgaaaggtg 2520 
actggaaatg ataaaacctg aagaagtaaa 2580 
ttataggaaa ttcagggaaa caactatgta 2640 
agaaagttat atagattcct tgctgttcct 2700 
cttgtctaaa ggcttataca gaggacatga 2760 
atgtttgcta aaattaaatg agcaagccgc 2820 
agtaaacaat ggtttgatta tcatgaatga 2880 
ggtggatgaa atggaagaaa aggatatact 2940 
ttcctacctt ggtcaagaaa tggaaccaca 3000 
aaaactgctt gattgttcta tggagattaa 3060 
atattccacg catgaactct gtgagcgatt 3120 
tcctgctgat ggaagataa 3169 
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